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ABSTRACT 

CpG methylation in mammalian DNA is known to 
interfere with gene expression by inhibiting the 
binding of transactivators to their cognate 
sequence motifs or recruiting proteins involved in 
gene repression. An Epstein-Barr virus-encoded 
transcription factor, Zta, was the first example of a 
sequence-specific transcription factor that prefer- 
entially recognizes and selectively binds DNA 
sequence motifs with methylated CpG residues, 
reverses epigenetic silencing and activates gene 
transcription. The DNA binding domain of Zta is 
homologous to c-Fos, a member of the cellular 
AP-1 (activator protein 1) transcription factor 
family, which regulates cell proliferation and 
survival, apoptosis, transformation and oncogen- 
esis. We have identified a novel AP-1 binding site 
termed meAP-1, which contains a CpG dinucleotide. 
If methylated, meAP-1 sites are preferentially bound 
by the AP-1 heterodimer c-Jun/c-Fos in vitro and in 
cellular chromatin in vivo. In activated human 
primary B cells, c-Jun/c-Fos locates to these 
methylated elements in promoter regions of tran- 
scriptionally activated genes. Reminiscent of the 
viral Zta protein, c-Jun/c-Fos is the first identified 
cellular member of the AP-1 family of transacti- 
vators that can induce expression of genes with 
methylated, hence repressed promoters, reversing 
epigenetic silencing. 



INTRODUCTION 

AP-1 (activator protein 1) is a family of transcription 
factors involved in cell proliferation and survival, apop- 
tosis, transformation and oncogenesis. AP-1 proteins 
contain a basic leucine zipper (bZIP) domain and belong 
to three different subfamilies: Jun (c-Jun, JunB and 
JunD), Fos (c-Fos, FosB, Fra-1 and Fra-2) and ATF 
(activating transcription factor: ATFa, ATF2, ATF3, 
ATF4 and B-ATF). To bind to DNA, members of the 
different subfamiHes need to form dimers, which show 
different affinities for two classes of DNA sequences. 
They are 12-O-Tetradecanoylphorbol- 12- Acetate (TP A) 
response elements (TRE) and cycHc adenosine 
monophosphate (cAMP) response elements (CRE), 
which encompass the consensus sequences 5^-TGAC/ 
GTCA-3^ and 5^-TGACGTCA-3^ respectively. In 
general, Jun/Jun and Jun/Fos dimers show a higher 
affinity for TRE sites, whereas Jun/ATF and Fos/ATF 
dimers preferentially bind to CRE sites (1). 

Different cellular signaling pathways, referred to as 
mitogen-activated protein kinase (MAPK) cascades, 
regulate both the expression and post-translational modi- 
fications of AP-1 proteins. The prototypic AP-1 member is 
the c-Jun/c-Fos heterodimer, and both proteins are 
regulated by MAPK pathways (2). Many different extra- 
cellular stimuli, such as growth factors and cytokines, 
trigger the MAPK cascades consisting of a hierarchy of 
sequential kinases that finally activate the three main 
MAP kinases called JNK, ERK and p38. These proteins 
can phosphorylate different transcription factors, which in 
turn activate the expression of c-jun and c-fos genes. Once 
expressed, the c-Jun and c-Fos proteins form dimers 
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and bind to their target DNA sequence motifs, but they 
also need to be phosphorylated, a modification that can be 
introduced by MAPKs as well, to activate transcription of 
target genes with AP-1 binding sites in their promoters (1). 

In eukaryotic cells, transcriptional activation depends 
on chromatin structure, nucleosomal occupancy, histone 
modifications and DNA methylation, which are diverse 
processes that play important roles in transcriptional regu- 
lation. Transcriptionally repressed heterochromatin is 
characterized by the frequent occurrence of cytosine nu- 
cleotides with a methyl group at the 5^ position of the 
cytosine pyrimidine ring followed by guanine nucleotides, 
termed CpG. DNA methylation affects the transcription 
of genes in two ways. First, methylation of DNA itself 
may physically impede the binding of transcriptional 
proteins to promoters and, second, methylated DNA 
may be bound by proteins known as methyl-CpG- 
binding domain proteins (MBDs). MBD proteins then 
recruit additional proteins to the locus, such as histone 
deacetylases and other chromatin remodeling proteins 
that modify histones and promote the formation of 
compact, inactive and silenced chromatin, leading to 
gene repression (3). 

CpG methylation is indicative of transcriptional repres- 
sion, but we and others have recently found that a viral 
transcription (Zta) factor preferentially binds to CpG- 
methylated DNA sequence motifs activating epigenetically 
repressed viral promoters (4,5). The Zta protein (also 
termed BZLFl, ZEBRA, Z or EBl) is the major immedi- 
ate early transactivator of a human herpes virus, Epstein- 
Barr virus (EBV). Zta's bZIP domain is highly homolo- 
gous to cellular AP-1 members, especially c-Fos (6). In a 
systematic approach, we mapped all Zta response 
elements (ZREs) within promoters of viral genes and 
identified two classes of ZREs, which are characterized 
by two distinct binding motifs. One class encompasses 
nearly all ZREs, which bind to an AP-1 -related DNA 
motif. The second class encompasses ZREs with a CpG 
motif. Zta binds to the majority of these ZREs only when 
the cytosine in the CG dinucleotide carries a 5^-methyl 
group. Consequently, these sites were termed meZREs. 
Zta binds to meZRE motifs and dramatically induces 
viral gene expression but in a methylation-dependent 
fashion (5). This exceptional feature is essential for this 
herpes virus to escape from its latent phase of infection, 
which is governed by transcriptional silencing of the 
majority of viral genes including extensive and widespread 
CpG methylation of viral DNA (7). 

To our knowledge, only two reports indicate that 
cellular transcription factors also can target methylated 
DNA sequence motifs supporting gene transcription in 
metazoan cells in vivo. Spl/Sp3 binding is not commonly 
impaired by CpG methylation (8) but rather enhanced in 
the context of a tissue-specific promoter (9). When 
methylated, cAMP response elements (CRE) or composite 
CRE and C/EBP sequence elements generate novel 
binding sites for C/EBPa activating certain promoters 
(10), which is an unexpected finding (11). 

Herpes viruses acquire cellular genes through recombin- 
ation with host cell DNA. Cellular genes adopted and 
further modified by this class of viruses regulate apoptosis. 



control the cell cycle of infected cells and block host 
immune responses (12). Very likely, the viral BZLFl gene 
encoding Zta is derived from a cellular member of the AP-1 
family (6), has evolved to act as the critical immediate-early 
gene of EBV to allow exit from latently infected cells (13) 
and activates resting primary human B cells, the target of 
EBV infection (14). As described earher in the text, the 
bZIP domain of Zta is highly conserved when compared 
with bZIP domains of AP-1 members and mediates binding 
to CpG-methylated DNA sequence motifs. Zta might have 
inherited this peculiar feature from cellular AP- 1 members 
that could target and activate repressed cellular promoters 
encompassing CpG-methylated DNA as well. This hypoth- 
esis would substantiate a previously unknown type of gene 
regulation and constitute a new example of a cellular tran- 
scription factor capable of activating gene expression from 
epigenetically repressed promoters through direct binding 
to methylated DNA. 

MATERIALS AND METHODS 

Cells 

HEK293, Raji and K562 cells were maintained in RPMI 
1640 medium with 10% fetal calf serum (PCS), 1% peni- 
cillin-streptomycin and 1% sodium pyruvate at 37°C and 
5% CO2. Wi-38 cells were maintained in Dulbecco's 
Modified Eagle Medium (DMEM) medium with 10% 
PCS, 1% penicilHn-streptomycin and 1% non-essential 
amino acids at 37° C and 5% CO2. Adenoid B cells were 
maintained in RPMI 1640 medium supplemented with 
10% PCS, 100|ig/ml streptomycin, lOOU/ml penicillin, 
lOOnM sodium selenite and 1 |ig/ml cyclosporin A at 
37°C and 5% CO2. Por cellular activation, B cells were 
cultured on irradiated CD40L feeder cells in the presence 
of 2ng/ml IL-4 for 5 days as described (15). 

Plasmids 

The wild-type maxi-EBV plasmid (p2089) contains the 
complete genome of the EBV prototype B95.8 strain and 
has been described (16). The DNA binding and dimeriza- 
tion domains of human c-Jun and c-Pos (amino acid 
residues 123-331 and 1-206, respectively), separated by 
a flexible linker (804)5 (1'7), were cloned downstream of 
the coding region of enhanced green fluorescence protein 
(eGPP) in pEGPP-Cl (Clontech), to generate the expres- 
sion plasmid p4271, or downstream of the tandem StrepII/ 
PLAG-tag (18) to yield the plasmid p4548. The plasmid 
expressing VP16:c-jun/c-fos (p4316) was generated by 
replacing the gfp gene by the VP 16 transactivation 
domain (TAD) of the VP 16 protein from HSV-1 (amino 
acid residues 404-490). Luciferase plasmids were con- 
structed by inserting pentamers of single me AP-1 sites 
into a basic luciferase reporter plasmid with a minimal 
EFla promoter. The plasmid backbone of this reporter 
plasmid is free of CpGs (19). 

DNA transfection 

Transfection of DNA into HEK293 cells was performed 
using polyethylenimine (Sigma-Aldrich) as previously 



Nucleic Acids Research, 2014, Vol. 42, No. 5 3061 



described (14). For protein extracts, 1 x 10^ cells per 130- 
mm dish were seeded the day before transfection. Each 
plate was transfected with 30 |ig of plasmid DNA. For 
reporter assays, 5x10^ HEK293 cells were seeded into 
6-well plates the day before transfection. Each well was 
CO transfected with 0.5 |ig of reporter plasmid together with 
1 jig of transactivator and 0.01 |ig of DNA of a renilla- 
expressing plasmid as an internal control for data 
normalization. 

In vitro DNA methylation 

CpG methylation in vitro was performed with the de novo 
methyltransferase M.Sssl and S-adenosyl methionine as 
described (5,20). 

In vitro immunoprecipitation assays with GFP:c-Jun/c-Fos 

For the in vitro pull-down assays, the plasmid p4271 
encoding gfp:c-jun/c-fos was transiently transfected into 
HEK293 cells. Nuclear extracts containing the GFP:c- 
Jun/c-Fos protein were used in in vitro immunopre- 
cipitation assays with Escherichia c6>//-derived genomic 
EBV DNA as described (5). 

Recombinant adenovirus generation and cell infection 

The adenoviral expression vector was created using 
the Gateway Recombination Cloning Technology 
(Invitrogen). Shortly, the gfp:c-jun/c-fos coding region 
was ampHfied by polymerase chain reaction (PGR), 
introduced in the PCR8/GW/TOPO vector, and then 
transferred to the pDEST12.2 vector. The latter plasmid 
was used to recombine the gfp:c-jun/c-fos cassette into the 
adenoviral vector pCAGAdDu-GFP (provided by Dr 
Vigo Heissmeyer). 

Stable transfection and establishment of cell lines 

Five micrograms of p4271 DNA was transfected into 
5x10^ Raji cells by electroporation in 250 |il medium at 
230 V and 975 |iF using a Biorad electroporation appar- 
atus in 4-mm cuvettes. Immediately after electroporation, 
400 |il of FCS was added to the cells, which were 
transferred to a flask containing 5 ml of cell culture 
medium. Cells were incubated at 37° C and 5% CO2 for 
one day. The cells were plated into 96-well plates in 200 |il 
of cell culture medium/well, supplemented with puro- 
mycin (1.2|ig/ml) until resistant cells grew out, which 
were further cultivated under selective pressure. 

Native chromatin immunoprecipitations 

A total of 5 X 10^ Raji cells expressing the GFP:c-Jun/ 
c-Fos protein were used for chromatin immunopre- 
cipitation (ChIP) experiments, which were performed as 
described previously (5). 

Electromobility shift assays 

Electromobility shift assays (EMSAs) were performed as 
described previously [Bergbauer et al. (5)]. 

Enzyme-linked immunosorbent assay 

Enzyme-linked immunosorbent assays (ELISAs) were 
performed with purified protein from HEK293 cells 



transiently transfected with Strep/FLAG:c-Jun/c-Fos 
(p4548). Two days post-transfection, cells from three 
130-mm dishes were pooled and lysed in 2 ml of RIPA- 
buffer (50 mM Tris, 150mM NaCl, 1% NP40, 0.5% 
DOC, 0.1% SDS, pH 8.0). Cell lysates were sonicated 
and Strep/FLAG:c-Jun/c-Fos was affinity purified with 
Strep-Tactin sepharose (iba-biotagnology). Strep/ 
FLAG:c-Jun/c-Fos was eluted in 500 |il of Strep elution 
buffer. Purified Strep/FLAG:c-Jun/c-Fos protein was 
coated on a 96-well ELISA plate at lOOnM in phos- 
phate-buffered saline at 4°C overnight. Cy5-labeled oligo- 
nucleotides were provided by Metabion. After washing 
and blocking with 5% bovine serum albumin, 100 |il of 
serial dilutions of annealed double-stranded Cy5-labeled 
oligos (in 20 mM Hepes, 75 mM NaCl, 1 mM DTT, 1% 
glycerin, 2mM MgCl2, 0.01 mg/ml bovine serum albumin 
and a 20x molar excess of polydldC) were then added and 
incubated for 2h at room temperature. Unbound oHgo 
was removed with repeated washes in phosphate- 
buffered saline. Finally, the Cy5 levels/well were 
measured in the phosphoimager (FLA 5100, Fuji). The 
obtained data were fit to the Hill equation with single 
site-specific binding using the Prism 5 software to calculate 
the dissociation constant. 

Lueiferase reporter assays 

Forty-eight hours post-transfection, the HEK293 cells 
were analyzed with the Dual-Luciferase Reporter Assay 
System (Promega). Lueiferase activity was measured in a 
96-well micr opiate luminometer (Victor^, Wallac). 

Bisulfite sequencing 

Bisulfite modification was done using the EpiTect Bisulfite 
Kit (Qiagen), and the regions of interest were amplified by 
suitably designed PGR primer pairs. Primer sequences are 
available on request. 

Library construction 

Sequence libraries were constructed with paired-end DNA 
sample preparation kits (Illumina) according to the manu- 
facturer's recommendations with minor modifications and 
as described previously (5). In vitro immunoprecipitations 
with non-CpG-methylated and fully CpG-methylated 
EBV DNAs were converted into the libraries 
'4271.unmethylated' and '4271. methylated', respectively. 
DNA obtained with ChIP of Raji cell chromatin was con- 
verted into the library '4427. Raji'. Wi-38 DNA used for 
Chip and ChIP DNA were converted into the libraries 
'4673.Wi38.Input' and '4673.Wi38.ChIP', respectively. 

Read mapping 

Sequencing reactions were performed on an Illumina 
Genome Analyzer IIx or Illumina HiSeq 2000 machines. 
We generated 32 milHon and 31 million 36-bp single- 
end reads for Hbraries '4271.unmethylated' and 
'4271. methylated', respectively, 50 milHon 36-bp single- 
end reads for '4427. Raji' and 50 and 47 milHon 100-bp 
pair-end reads for the '4673. Wi38. Input' and 
'4673. Wi38. ChIP' samples, respectively. Read mapping 
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to the reference sequence and subsequent assembly was 
performed using the resequencing software Burrows- 
Wheeler AHgnment (BWA) (vO.7.1; 21). We used a com- 
bination of the recombinant EBV strain 2089 (16) and the 
bovine genome (build bosTau4) as reference sequences for 
the two libraries '4271.unmethylated' and '4271. 
methylated'. Calf thymus DNA was used as an unspecific 
competitor in the in vitro immunoprecipitation assays with 
GFP:c-Jun/c-Fos. The reference sequence for the in vivo 
samples '4427.Raji', '4673.Wi38.Input' and '4673. 
Wi38.ChIP' was the human reference sequence hgl9. 

Transcription factor binding site detection 

Genomic regions with a read depth above background 
level are considered as regions containing transcription 
factor binding sites. We used two different programs, 
Site Identification from Short Sequence Reads (SISSRs) 
(22) and QuEST (23), to identify these regions. 
Transcription factor binding sites are typically shorter 
than the sequenced DNA fragments. Binding sites 
for the sample Hbraries '4271.unmethylated' and 
'4271. methylated' were determined using SISSRs 
standard parameters, except for E (min number of direc- 
tional reads), which was set to 3200 for the library 
'4271.unmethylated' and to 1600 for '4271. methylated', 
and F (average length of DNA fragments), which was 
set to 200 in both cases. Binding sites for the sample 
'4427. Raji' were inferred using the default QuEST param- 
eters, except for kernel density estimator and read count 
threshold (th), which were set to 30 and 160, respectively. 
In the case of the sample '4673.Wi38.ChIP', it was 
analyzed by SISSRs with the results from'4673. 
Wi38. Input' as background file and setting th at 220. To 
identify consensus motifs within the discovered binding 
sites, the sequences containing possible binding sites 
were extracted and used as input for the motif finding 
algorithm Multiple EM for Motif EHcitation (MEME) 
(24). 

ChlPs with an a-c-Jun antibody 

Chip experiments were conducted following standard 
protocols. Chromatin was cross-linked for 10 min 
at room temperature and incubated with the antibody 
a-c-Jun (sc-44, Santa Cruz Biotechnology). 

Quantitative real time PCR 

Quantitative real time PCR (qRT-PCR) analysis was per- 
formed with a LightCycler 480 II instrument (Roche), 
with software version 1.5.0 SP4, according to the manu- 
facturer's instructions. AmpHfications were monitored 
with the LightCycler FastStart reaction mix (SYBR 
green I; Roche). The amplification of PCR products was 
terminated after 45 cycles. The following touchdown PCR 
protocol was used: (i) initial denaturation at 95° for 10 min 
and (ii) 45 cycles of 95° (2 s), 65°-62° (0.6° decrement in 
annealing temperature/cycle; 10 s), 72° (10 s). For all 
primer pairs, a standard curve from serial dilutions of 
the PCR product with known concentration or the input 
DNA was assessed. Primer sequences are available on 
request. 



B cell isolation 

Human primary B cells from adenoids were prepared 
and naive B cells were sorted as described previously 
(14,25). Anonymized adenoid samples from routine 
adenoidectomies were provided by the Department of 
Otorhinolaryngology, Klinikum Grosshadern, Ludwig- 
Maximilians-University of Munich. The institutional 
review board, Ethikkommission of the Klinikum 
Grosshadern, approved the study and did not require 
prior informed patient consent. 

Bioinformatic identification of meAP-1 sites in primary 
human B cells 

In the reference human genome assembly GRCh37/hgl9, 
we detected AP-1 and meAP-1 binding sites using the search 
algorithm Find Individual Motif Occurrences (FIMO) (26) 
based on our MEME output motifs. Called sites were 
mapped to promoter sequences of RefSeq genes contained 
in the UCSC Genome Browser — 5 kb to +1 kb of their tran- 
scription start sites (TSS) (27). Predicted meAP-1 binding 
site within promoter sequences were further evaluated for 
CpG methylation in naive B cells (28) (C04M7ACXX_XF, 
D088DACXX_XF, D08E5ACXX_XF). For further 
analysis, we selected only meAP-1 binding sites that show 
at least 80% methylation in two of three repHcates. To 
identify genes upregulated in activated B cells, we used ex- 
pression data obtained from unstimulated (GSM907730, 
GSM907738, GSM907746) and stimulated (GSM907731, 
GSM907739, GSM907747) B cell data sets available on 
Gene Expression Omnibus (GSE36975) (29). We selected 
transcripts that were significantly expressed in 5 of 6 
samples. Transcripts showing significant differential ex- 
pression between unstimulated and stimulated B cells 
were identified performing a one way ANOVA test using 
MATLAB. 



RESULTS 

c-Jun/c-Fos binding motifs in genomic EBV DNA 

We started our search with a synthetic model, which 
builds on the entire DNA genome of EBV cloned in 
E. coli as a single plasmid (16). This DNA is large 
(~180kb), GC-rich (60%), free of methylated cytosine nu- 
cleotides (which can be easily methylated in vitro), 
contains ~85 genes and diverse promoter elements and 
has been used previously for the analysis of Zta's DNA 
binding characteristics (5). We predicted that the DNA's 
complexity and size is suitable to identify putative DNA 
motifs, which are targeted by AP-1 if methylated. The 
second component in our model is a single-chain c-Jun/ 
c-Fos protein designed to ensure the formation of a single, 
defined AP-1 heterodimer unaffected by other proteins 
that complex with either c-Jun or c-Fos. Bakiri et al. 
(17) showed that similar constructs with the murine se- 
quences of c-Jun and c-Fos specifically bind and 
transactivate cellular promoters that contain the consen- 
sus AP-1 binding site. 

We used in vitro pull-down experiments as described (5) 
with randomly sheared non-CpG-methylated or fully 
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CpG-methylated EBV DNA and a GFP:c-Jun/c-Fos 
fusion protein purified from HEK293 cells. 
Immunoprecipitation of the fusion protein followed by 
deep sequencing of the enriched DNA fragments 
provided an extensive coverage of DNA sequences select- 
ively bound by the AP-1 heterodimer (Supplementary 
Figure SI). A comparison of fully CpG-methylated 
versus non-methylated-DNA identified several regions 
preferentially bound by c-Jun/c-Fos if CpG-methylated 
(Supplementary Figure SI, bottom), three of which were 
located at EBV promoters: BFLF2, BBLF4 and BALFl 
(Supplementary Figure S2A). Algorithms suitable to 
identify sequences of potential binding sites readily 
found the known palindromic AP-1 consensus binding 
site 5^-TGACTCA-3^ (Figure 1) in samples of 
unmethylated as well as methylated DNA, confirming 
the vaHdity of our approach. Putative AP-1 binding 
motifs identified in CpG-methylated DNA were grouped 
according to whether they encompassed CpG pairs or not. 
The reanalysis revealed a second, novel motif, 5^-TGACG/ 
TCG-3^ which is closely related to the consensus AP-1 but 
may contain up to two CpG dinucleotides. We termed this 
motif 'meAP-l' in analogy to meZRE motifs bound by 
Zta (5). Lists of the identified AP-1 binding sites are 
provided in Supplementary Tables S1-S3. 

To determine whether the biochemically identified 
meAP-1 sites in EBV DNA are also bound by c-Jun/ 
c-Fos in cells in vivo, we engineered a derivative of the 
latently EBV-infected cell line Raji with a constitutively 
expressed GFP:c-Jun/c-Fos chimera. ChIP followed by 
deep sequencing analysis (ChlP-seq) of DNA fragments 
specifically bound to GFP:c-Jun/c-Fos protein was per- 
formed, mapped to the EBV genome and analyzed 
bioinformatically in conjunction with methylation data 
(7). Confirming our results in vitro, the analyzed pro- 
moters in Raji cells were only enriched if the meAP-1 
sites were highly CpG-methylated (Supplementary 
Figure S2). 

c-Jun/c-Fos binds CpG-methylated meAP-1 sites with 
high affinity mediating transcriptional activation 

The different meAP-1 motifs identified in the EBV genome 
may contain up to two CpG pairs (Figure 1), but the three 
more prominent meAP-1 sites (Supplementary Figure SI), 
located in three EBV promoters, encompass the sequence 
5^-TGACTCG-3^ with one CpG pair, only (Figure 2A). 
We addressed this uncertainty together with the question 
of whether 5^-methylcytosine residues present in the top, 
bottom or both strands of the meAP-1 DNA sequence 
affect the affinity of c-Jun/c-Fos binding to DNA. 

Unmethylated or CpG-methylated synthetic oligo- 
nucleotides containing the meAP-1 binding sites in the 
BBLF4, BFLF2 and BALFl promoters (Figure 2A) and 
three promoters identified later in ChlP-seq experiments 
with Raji cell chromatin (Supplementary Figure S3) were 
analyzed in EMSAs. As shown in Figure 2B, c-Jun/c-Fos 
bound almost exclusively to the CpG-methylated meAP-1 
motif 5^-TGACTCG-3^ that contains a single CpG pair, 
but methylation of a CpG pair in a more central position 
of the motif decreased the binding of c-Jun/c-Fos 



unmethylated methylated 




-1 123456789 -1 1234567 

AP-1 (35 Sites) meAP-1 (9 sites) 



Figure 1. Bioinformatic identification of c-Jun/c-Fos binding motifs in 
the EBV genome. Deep sequencing data were analyzed with the SISSRs 
algorithms for putative stretches of DNA, which c-Jun/c-Fos binds. 
The output files of SISSRs were used as training sets for MEME, 
which identifies gapless, local and multiple sequence motifs. A total 
of 61 motifs were identified in the DNA free of CpG methylation 
and 44 motifs were identified in the fully CpG-methylated EBV 
DNA. The identified motifs in methylated DNA were selected at the 
level of the SISSRs training set data and grouped into motifs with and 
without CpG dinucleotides followed by MEME analysis. 



(Supplementary Figure S3B). Therefore, we concentrated 
our analysis on meAP-1 binding sites with a CpG pair at 
the distal position. The consensus meAP-1 site consists of 
the sequence motif 5^-TGACTCG-3^ and only the methyl 
group in 5^ position of the cytosine in the bottom strand is 
necessary for the methylation-specific binding of c-Jun/c- 
Fos to DNA (Figure 2C). 

To quantify DNA binding, the apparent dissociation 
constants (KD) of c-Jun/c-Fos were measured with 
unmethylated or methylated meAP-1 binding sites. The 
KD values of BBLF4, BALFl and BFLF2 meAP-1 sites 
were lower by a factor of 38.7, 10.0 and 171, respectively, 
if methylated, indicating that the presence of 
5^-methylcytosine increases the binding affinity of c-Jun/ 
c-Fos to DNA (Figure 2D). The consensus AP-1 binding 
site from the human 'collagenase' promoter (TREcollag) 
with a pubHshed KD for c-Jun/c-Fos of 10.8 nM (30) was 
used as a positive control. 

The transactivation potential of the c-Jun/c-Fos 
heterodimer was investigated in luciferase reporter 
assays. Reporter plasmids containing pentamers of the 
three selected meAP-1 sites were transiently transfected 
into HEK293 cells together with an expression plasmid 
encoding VP16:c-Jun/c-Fos, in which the VP16 
transactivator domain was fused to the single-chain AP- 
1 heterodimer. VP16:c-Jun/c-Fos induced the CpG- 
methylated reporter plasmids by a factor of 2-5 as 
compared with their non-methylated controls (Figure 2E). 

The human genome contains meAP-1 binding sites 

Two different cell Hnes were used to identify AP-1 and 
meAP-1 binding sites in vivo, the Burkitt lymphoma B- 
cell Hne Raji and the human fibroblast cell Hne Wi-38. 
As explained earlier in the text, a Raji cell line expressing 
the GFP:c-Jun/c-Fos chimera was used in ChlP-seq 
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Figure 2. The c-Jun/c-Fos binds preferentially to and activates gene transcription through CpG-methylated meAP-1 motifs. (A) Oligonucleotides 
containing meAP-1 binding sites identified in the viral BBLF4, BALFl and BFLF2 promoters. The TREcollag oHgo containing a consensus AP-1 
binding site was used as a control for the KD measurements. The meAP-1 sites and AP-1 site are indicated in bold capital letters, and the CpG pairs 
within the meAP-1 sites are underhned. (B) Preferential binding of c-Jun/c-Fos to methylated meAP-1 sites from the EBV genome. EMSAs were 
performed with affinity-purified GFP-tagged c-Jun/c-Fos fusion protein transiently expressed in HEK293 cells. Supershifts with a GFP antibody 
confirmed the identity of the protein-DNA complexes. EMSAs of typical experiments are shown as examples. (C) Only one methyl group is 

(continued) 
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experiments. In a second experimental setting, serum- 
starved Wi-38 cells were infected with a GFP:c-Jun/ 
c-Fos-expressing adenovirus. Chromatin from Wi-38 
(two days post infection) and Raji cells was prepared, 
immunoprecipitated with an a-GFP antibody to enrich 
DNA fragments specifically bound by GFP:c-Jun/c-Fos 
protein and analyzed with next-generation sequencing 
(Figure 3). 

In accordance with our first functional analysis shown 
in Supplementary Figure S3B and discussed earher in the 
text, we adjusted the bioinformatic parameters and refined 
the characteristics of the two AP-1 motifs found previ- 
ously: consensus AP-1 motifs lacking a CpG dinucleotide 
and putative meAP-1 motifs with a single distal CpG pair 
in positions six and seven (Figure 3 A and B). The two 
predominant motifs revealed in both cellular models 
confirm the known AP-1 consensus (5^-TGACTCA-30 
and identified an infrequent second meAP-1 motif 
(5^-TGAC/GTCG-30 (Figure 3A and B). Supplementary 
Tables S4 through S7 provide the location of the AP-1 and 
meAP-1 sites in both cell types. 

The majority of all consensus AP-1 sites are located in 
intergenic regions (31,32) with no assigned functions. To 
concentrate our further analysis on meAP-1 sites in prox- 
imity of promoter elements, meAP-1 sites located within 
5 kb upstream of annotated TSS were considered only if 
no other conventional AP-1 was located within a window 
of ± 1 kb. Only a small number of cellular genes fulfilled 
our criteria. From ChlP-seq experiments of Raji cells, we 
included the promoters of three genes in our study, PTK6, 
P2RY5 and SUV39H1. In Wi-38 cells, we concentrated on 
two genes, CBR3 and LRRTM2. A positive control, the 
collagenase promoter MM PI containing two AP-1 
binding sites (33) was also included. 

Visualization of the coverage of these promoters after 
ChlP-seq analysis indicated that they are differentially 
bound by c-Jun/c-Fos in the two cell types (Figure 3C), 
suggesting that the extent of cytosine methylation might 
also differ. Bisulfite sequencing of the five promoters 
revealed only one promoter element, P2RY5, that was 
fully CpG-methylated in both cell lines, but CpG pairs 
in other promoters with meAP-1 sites differed in a cell 
type-specific manner (Supplementary Figure S4). 

5 -methylcytosine-dependent binding of c-Jun/c-Fos to 
meAP-1 sites in the human genome induces 
transcriptional activation 

Only 5% of meAP-1 sites identified in both cell types are 
located in putative promoter regions, and therefore 



presumably involved in directly regulating gene expres- 
sion. Three candidate meAP-1 motifs with no AP-1 con- 
sensus binding sites in their close proximity were analyzed 
further: the meAP-1 sites in the PTK6, P2RY5 and 
SUV39H1 promoters (Figure 4). As shown in Figure 4B, 
c-Jun/c-Fos bound almost exclusively to the CpG- 
methylated meAP-1 motif. In ELISAs, the apparent dis- 
sociation constants (KD) of PTK6 and P2RY5 meAP-1 
sites could only be determined with methylated oligo- 
nucleotides because KD values of the unmethylated 
probes were too high to be estimated (Figure 4C). The 
KD for the SUV39H1 meAP-1 site could not be 
calculated, but the results confirm that the binding 
affinity is higher when the meAP-1 site is methylated 
(Figure 4B and C). These results indicate that the 
presence of 5^-methylcytosine favors the binding of 
c-Jun/c-Fos to the meAP-1 sequences identified in the 
human genome. 

The ability of c-Jun/c-Fos to transactivate gene tran- 
scription through the binding to the methylated meAP-1 
sites from the human genome was tested in reporter 
assays. Reporter plasmids containing pentamers of 
selected meAP-1 sites in the human promoters were 
cotransfected with an expression plasmid encoding the 
transactivator VP16:c-Jun/c-Fos or a negative control 
DNA. Luciferase activities with CpG-methylated 
reporter plasmids were significantly higher as compared 
with unmethylated reporter plasmid DNAs (2.0-2.7-fold, 
Figure 4D). 

Promoters with methylated meAP-1 sites are bound by 
c-Jun/c-Fos in vivo 

So far our studies were based on the ectopic expression of 
artificial c-Jun/c-Fos dimers to identify potential meAP-1 
sites in cellular DNA. In contrast, Raha et al. (31) mapped 
the location of endogenous c-Jun and c-Fos protein in two 
sets of ChlP-seq experiments and identified >40 000 
binding sites in the established human myeloblastic cell 
line K562. To correlate their findings and ours, we per- 
formed Chip assays in this cell line with a c-Jun specific 
antibody and investigated by quantitative PCR analysis 
the enrichment of the five previously identified promoter 
elements PTK6, P2RY5, SUV39H1, CBR3 and LRRTM2 
along with negative and positive control regions (34). As 
shown in Supplementary Figure S5, the previously 
identified five promoter elements, which contain only 
meAP-1 but no consensus AP-1 sites, were enriched at 
levels similar to or exceeding those of the positive 
controls in K562 chromatin indicating that endogenous 



Figure 2. Continued 

responsible for the binding of c-Jun/c-Fos to methylated DNA. The meAP-1 and AP-1 binding sites contained in the ohgonucleotide used in EMSAs 
are shown; the asterisks represent a 5'-methylcytosine residue. EMSAs using BBLF4 and TREcollag oligonucleotides with the sites depicted in the 
left panel and GFP:c-Jun/c-Fos-purified protein are shown. One representative experiment out of three is provided. (D) Determination of the 
apparent KD of c-Jun/c-Fos bound to meAP-1. ELISAs were performed with affinity-purified Strep/FL AG-tagged c-Jun/c-Fos fusion protein 
transiently expressed in HEK293 cells. Serial dilutions of Cy5 ohgonucleotides were used to determine the KD values. The obtained data were 
fitted to the Hill equation with one site-specific binding to determine the dissociation constants, which are indicated. Each data point indicates the 
mean and standard deviation of three independent experiments. (E) Functional identification of single meAP-1 binding sites. Pentamers of single 
meAP-1 sites present in the promoters of BBLF4, BALFl and BFLF2 EBV genes as in (A) were introduced into a basic luciferase reporter plasmid 
with a minimal EFla promoter and free of CpG dinucleotides. Unmethylated and fully CpG-methylated reporter constructs were analyzed in the 
presence or absence of a cotransfected VP16:c-Jun/c-Fos expression plasmid. Each experiment was performed three times and the means and 
standard deviations are depicted. 
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Figure 3. Identification of AP-1 binding motifs in the human genome. Identification of c-Jun/c-Fos binding motifs in Raji (A) and Wi-38 (B) cells. The 
samples used for deep sequencing were prepared as indicated. Data obtained from Raji and Wi-38 cell ChlPs were analyzed with the algorithms QuEST 
and SISSRs, respectively. MEME analysis was conducted as described in detail in the legend of Figure 1. In chromatin of Raji and Wi-38 cells, 9232 and 
6547 motifs, respectively, were discovered, which were subsequently split into two groups. One group contains all motifs identified with the aid of the 
MEME algorithm harboring a CpG pair in positions 6 and 7 of the consensus motif. In all, 80 and 48 motifs were identified in Raji and Wi-38 cells, 
respectively, termed meAP-1. The remaining motifs were reanalyzed by MEME, which identified 7017 and 6499 sites that correspond to the known 
consensus AP-1 site lacking a CpG dinucleotide. (C) Visualization of selected human promoters with meAP-1 sites that bind c-Jun/c-Fos in vivo. The 
'collagenase' {MM PI) promoter served as a positive control. Depicted is the read depth (coverage) at single base pair resolution (y-axis). The boxes 
indicate the position of the transcripts. Genomic coordinates correspond to the human genome assembly GRCh37/hgl9. 
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Figure 4. Functional analysis of human meAP-1 sites. (A) Oligonucleotides containing meAP-1 binding sites of the PTK6, P2RY5 and SUV39H1 
promoters. The meAP-1 sites are indicated in bold capital letters and the CpG pairs within the meAP-1 sites are underhned. (B) EMSAs showing the 
preferential binding of c-Jun/c-Fos to the CpG-methylated meAP-1 sites of the PTK6, P2RY5 and SUV39H1 promoters. EMSAs were performed 
with affinity-purified GFP-tagged c-Jun/c-Fos fusion protein transiently expressed in HEK293 cells. EMSAs of typical experiments are shown. 
(C) Determination of the apparent KD of c-Jun/c-Fos bound to meAP-1. ELISAs were performed with affinity-purified Strep/FLAG-tagged c- 
Jun/c-Fos fusion protein transiently expressed in HEK293 cells. Serial dilutions of Cy5 ohgonucleotides were used to determine the KD values. The 
data were fitted to the Hill equation with one site-specific binding to determine the dissociation constants, which are indicated. Each data point 
indicates the mean and standard deviation of three independent experiments. (D) The c-Jun/c-Fos activation of promoters with meAP-1 binding sites 
is increased on CpG methylation. Pentamers of short ohgonucleotides shown in (A), which contain the meAP-1 sites present in the promoters of the 
human PTK6, P2RY5 and SUV39H1 genes were introduced into a basic luciferase reporter plasmid as described in Figure 2E. Unmethylated and 
fully CpG-methylated reporter constructs were analyzed in the presence or absence of a cotransfected VPld.c-Jun/c-Fos expression plasmid. Each 
experiment was performed three times and means and standard deviations are depicted. 
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AP-1 can target meAP-1 sites in the context of cellular 
chromatin. 

Next, we analyzed the state of CpG methylation in 
K562 cell DNA by bisulfite sequencing. The repetitive 
meAP-1 site within the P2RY5 promoter was fully 
methylated (Supplementary Figure S6A) similar to the 
situation in Raji and Wi-38 cells (Supplementary Figure 
S4A and B). In contrast, two other promoters were only 
moderately methylated (Supplementary Figure S6A). If 
c-Jun/c-Fos preferentially bound to methylated meAP-1 
sites. Chip experiments with an a-c-Jun antibody 
should enrich methylated DNA. A clear enrichment of 
methylated CpGs could be detected in the two promoters 
of the PTK6 and LRRTM2 genes sequenced after bisulfite 
treatment (Supplementary Figure S6B). The single func- 
tionally identified meAP-1 binding site in the PTK6 
promoter in Raji cell chromatin (Figure 3C) was 
enriched 3-fold after ChIP in K562 cells, whereas the re- 
maining three potential meAP-1 sites in this promoter, 
which were only predicted with the aid of the search algo- 
rithm FIMO (26), could not be confirmed in this func- 
tional analysis (Supplementary Figure S6B), supporting 
the vaHdity of our approach. 

In vivo^ c-Jun binds meAP-1 motifs in promoters of genes 
upregulated in activated primary human B cells 

Our detailed characterization of the newly discovered 
meAP-1 motif provided evidence that c-Jun/c-Fos binds 
these sequences in vivo. To substantiate our findings, we 
wanted to assess if meAP-1 sites are also capable of 
conferring transcriptional regulation in the context of 
cellular chromatin in a relevant model. AP-1 signaling 
cascades play an important role in physiological B cell 
activation and differentiation. Comprehensive data sets 
are publicly available that address key aspects of the 
biology of primary lymphocytes comparing genes 
expressed in resting and activated naive human B cells 



(29). In addition, the methylome of naive human B cells 
has been determined recently (28). 

Individual AP-1 and me API sites recruit the 
heterodimer c-Jun/c-Fos in a highly cell specific manner 
as exemplified in Figure 3C comparing Raji B cells and 
Wi-38 fibroblasts. Therefore, we turned to the search al- 
gorithm FIMO (26) to detect potential AP-1 and meAP-1 
binding sites in the reference human genome assembly 
GRCh37/hgl9 (35) (Supplementary Tables S8 and S9, re- 
spectively). Next, we identified those genes in human 
DNA that (i) contain potential meAP-1 binding sites 
within their promoter regions, from — 5kb to +lkb 
relative to the TSS, and (ii) were upregulated by a factor 
of two or more comparing activated with resting human 
naive B cells 16 h post induction with CD40 ligand, CpG 
stimulation and B cell receptor cross-linking (29). Finally, 
we excluded from this Hst those meAP-1 sites in which the 
degree of cytosine methylation of the CpG pair was <80% 
in naive B cells (28). Applying these bioinformatic selec- 
tion criteria, we identified ~280 genes (Supplementary 
Table SIO), which were upregulated in activated B cells 
(29) and contain methylated meAP-1 sites in their pro- 
moters. To focus our attention on the most promising 
candidates, we picked 10 genes with presumed functional 
meAP-1 sites located within the first 2kb upstream of the 
TSS that are at least 1 kb apart from a consensus AP-1 
site, if any. 

Table 1 Hsts the selected genes that fulfill all these 
stringent criteria and contain the consensus meAP-1 
motif 5^-TGACTCG-3^ Next, we isolated naive primary 
human B cells, which were activated with CD40-ligand 
and IL-4 for 5 days to induce their proliferation and 
activate cellular AP-1 pathways. ChIP assays with a 
c-Jun-specific antibody were performed and the enrich- 
ment of the in silico selected promoter elements was 
assessed by quantitative PCR analysis. As shown in 
Figure 5, the 10 promoter elements were enriched at 
levels similar to those of the positive controls in activated 
B cells. The selected meAP-1 sites were confirmed to be 



Table 1. Selection criteria and experimental results of promoters of meAP-1 -regulated genes in naive human B cells and the cell Hne K562 



Gene name 


Fold 

upregulation^ 


CpG 

methylation^ (%) 


Distance meAP-1 
to TSS (nt) 


Distance AP-1 
to TSS (nt) 


Chip in primary 
B cells" 


BS-seq^ (%) 


Chip in 
K562 ceUs" 


DCTPPl 


14.03 


>98 


-678 


n.a. 


2.5 


>98 


3.0 


SLAMFl 


6.13 


100 


-1698 


n.a. 


2.6 


>90 


4.8 


UQCRB 


3.89 


>90 


-1244 


n.a. 


3.0 


>98 


4.0 


SCRIB 


3.04 


>80 


-1625 


n.a. 


3.3 


>90 


3.6 


VATl 


2.01 


>90 


-1735 


n.a. 


2.7 


>98 


4.2 


SAEl 


5.79 


>90 


-718 


-2831 


2.3 


>98 


5.3 


PSMCl^ 


4.69 


>80 


-1418 


-2532 


3.0 


100 


6.0 






>90 


-929 




2.6 


n.d. 


4.1 


ITFG2 


3.09 


>90 


-1007 


-4336 


2.7 


>90 


4.4 


LARP4B 


2.85 


>80 


-1272 


-4707 


2.6 


>98% 


3.6 


FOR 


2.06 


>90 


-744 


-2004 


2.5 


>90% 


7.0 



n.a., not applicable; n.d., not done. 

^Transcriptional activation in naive B cells 16 h post stimulation (29). 
^methylome data from resting naive B cells (28). 

'^data derived from Figure 5, fold enrichment compared with the mean of negative controls. 
^BS-seq; bisulfite sequencing; data derived from Supplementary Figure S8. 

"Data derived from Supplementary Figure S7, fold enrichment compared with the mean of negative controls. 
^PSMCl promoter encompasses two meAPl and one AP-1 site. 
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Figure 5. In vivo binding of c-Jun/c-Fos to cellular promoters with 
meAP-1 sites in activated primary human B cells. After immunopre- 
cipitation (ChIP) of chromatin from activated naive B cells with an a-c- 
Jun antibody, the enrichment of the indicated promoter regions was 
assayed by quantitative PCR. Shown are the results obtained from 
three promoter regions indicated in red that contain conventional pre- 
viously identified AP-1 sites (34) and three promoter regions indicated 
in black, which were identified in this study and are devoid of conven- 
tional AP-1 or meAP-1 sites. Both sets of promoters served as positive 
and negative controls, respectively. Ten promoters that were found to 
contain meAP-1 sites (and lack conventional AP-1 sites in proximity) 
were analyzed by ChIP and the results are shown in green. The PSMCl 
promoter contains two separate regions with one meAP-1 site each 
depicted as PSMCl -1 and -2, which were analyzed separately. Details 
of all promoters investigated here are hsted in Table 1. Means and 
standard deviations of three independent experiments are shown. 



methylated by analyzing the state of CpG methylation 
in genomic DNA of non-activated B cells by bisulfite 
sequencing (Table 1 and Supplementary Figure S8). 
Supporting our finding in primary B cells, the 10 
promoter regions were also bound by c-Jun in the K562 
cell Hne (Supplementary Figure S7). 

Table 1 summarizes the original data that led to the in 
silico identification of the 10 genes and the experimental 
results obtained in this work. Our data indicate that the 
predicted meAP-1 binding sites are CpG-methylated and 
targeted by c-Jun in activated B cells in vivo, suggesting 
that individual meAP-1 sites contribute to transcriptional 
activation of highly CpG-methylated, hence epigenetically 
repressed genes in naive B lymphocytes on their activa- 
tion. One caveat remains that cannot be addressed experi- 
mentally: mitogenic physiological signals in B cell 
proHferation and development-Hke B cell receptor cross- 
linking, CD40 engagement and interleukin receptor 
signals induce the MAPK cascades leading to AP-1 acti- 
vation but also engage additional signaHng cascades that 
might also act on meAP-1 -regulated genes assisting their 
transcriptional reactivation (36). 



DISCUSSION 

The aforementioned results confirm our starting hypoth- 
esis and indicate that AP- 1 has the capacity to bind a class 
of previously unknown discrete DNA sequence motifs 



with methylated cytosine residues in vivo. It thus appears 
that Zta is not only a closely related member of the large 
AP-1 family because it shares a highly conserved DNA 
binding domain with c-Fos (6), but also may have 
evolved from a predecessor of the AP-1 protein family, 
which can specifically make contacts with the 5^-methyl 
group of cytosine nucleotides. We propose that additional 
members of the vast AP-1 family can also bind to 
methylated DNA in a sequence-specific manner. 

A close inspection of ZRE/meZRE and AP-1 /meAP-1 
pairs (Figure 6) suggests that they are related but have 
evolved differently, (i) In general, nucleotide transitions 
(but not transversions) cause the differences within the 
ZRE/meZRE and AP-1 /meAP-1 pairs (Figure 6A) reflect- 
ing only minor differences in the molecular structures of 
thymine and 5^-methylcytosine (Figure 6B). (ii) The 
position of the methylated cytosine in the consensus se- 
quences of meZRE and meAP-1 is offset by one nucleotide 
(Figure 6A). (iii) The consensus AP-1 motif is almost pal- 
indromic but the meAP-1 motif is not, suggesting that 
either c-Jun or c-Fos preferentially make contact with 
the 5^-methylcytosine group. This view is supported by 
two detailed reports indicating that c-Fos and c-Jun are 
unequal siblings (37,38). Interestingly, Risse et al. (38) 
delineated the importance of individual thymine residues 
in the AP-1 consensus sequence and found that all 
thymines including the two distal positions crucially con- 
tribute to DNA-protein interactions. 

Zta is the first example of a sequence-specific transcrip- 
tion factor, which preferentially recognizes and selectively 
binds DNA sequence motifs with methylated CpG 
residues and activates gene transcription through these 
distinct motifs (4,5). Previously, other mammalian 
factors (RXFl, RBP-J, ZBTB4) have been shown to 
bind DNA sequences encompassing 5^-methylcytosine nu- 
cleotides in vitro, only, and no evidence for methylation- 
dependent binding of these factors exists in cells in vivo 
(39-42). Retrospectively, this failure is not surprising 
because consensus meAP-1 motifs are rarely targeted by 
c-Jun/c-Fos in cellular chromatin. Accessible and there- 
fore functional meAP-1 sites account for <2% of the 
45 000 c-Jun/c-Fos binding sites that are predicted in 
human genomic DNA (Figure 3A and B; Supplementary 
Table S9). This is in contrast to the high number of con- 
sensus AP-1 sites predicted in silico (32), which are readily 
bound by factors of the AP-1 family in human chromatin 
(31,34). 

Interestingly, a recent systematic in vitro analysis of a 
large panel of transcription factors identified their 
frequent binding to sequence motifs with methylated 
CpG dinucleotides (43). The search did not identify the 
non-canonical meAP-1 motif reported here. Rather, it 
revealed distinct and therefore unrelated motifs either 
free of CpG dinucleotides or with methylated CpGs 
targeted by the same mammalian transcription factor. 
This observation is surprising and clearly in contrast 
to our findings, which led to the discovery of the 
meAP-1 motif, a close relative of the canonical AP-1 site 
(Figure 6). 

Our results indicate that the 5^-methylcytosine residue 
can functionally substitute for the crucial distal thymine in 
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Figure 6. Sequence motifs of AP-1 and ZRE binding sites and their 
molecular features. (A) Comparison of AP-1, ZRE, meAP-1 and 
meZRE binding sites. The CpG pairs in the meAP-1 and meZRE 
motifs are underlined. (B) Structure of the DNA bases cytosine, 
5^-methylcytosine and thymine. The dashed Hues indicate the 
hydrogen bonds with the bases of the complementary DNA strand. 



one strand of the AP-1 consensus motif providing similar 
molecular structures for AP-1 binding (Figures 2C and 6). 
The structural resemblance of thymine to 
5^-methylcytosine suggests that the c-Jun/c-Fos 
heterodimer binds similarly to AP-1 and meAP-1 sites 
(39,44). As c-Jun/c-Fos heterodimers target meAP-1 sites 
in vivo (Figures 3 and 5, Supplementary Figures S5 and 
S7), the status of DNA methylation determines AP-1 
binding and consequently responsiveness to AP-1- 
mediated transcriptional activation. In this context, 
DNA methylation adds a new layer to gene regulation 
and presumably also cell type- and tissue-specific function- 
aUty of signaling pathways that activate AP-1 family 
members. 

The majority of consensus AP-1 sites that were 
identified in cellular chromatin are located in intergenic 
regions or are distant from transcriptional start sites and 
have unknown functions (31,34). We see a comparable 
distribution of meAP-1 sites. Of the 45 000 potential 
meAP-1 sites (Supplementary Table S9), only a small 
fraction resides within 5 kb upstream of transcriptional 
start sites suggesting that these meAP-1 sites might serve 
an important and direct role in regulating genes that are 
governed by epigenetic inhibition including DNA methy- 
lation. The functions of the many consensus AP-1 and 
non-canonical meAP-1 site localized in intergenic regions 
remain an enigma that needs to be addressed. 

Owing to the nature and frequency of potentially func- 
tional meAP- 1 binding sites in the cell lines and conditions 
used in this study, we had to restrict our search in estab- 
lished cell lines to promoters that are bound by c-Jun/ 
c-Fos heterodimers and contain meAP-1 sites but lack 
consensus AP-1 sites in proximity for obvious technical 
reasons. The strict filtering criteria resulted in a small set 
of promoter candidates that likely do not reflect profound 
or even decisive functions of the identified genes in human 



cells. The situation is different in primary naive B cells of 
human origin, in which we concentrated on meAP-1 in the 
promoters of genes upregulated after induced B cell acti- 
vation. The fact that AP-1 is binding these meAP-1 
binding sites suggests their having functional roles in 
metazoan gene regulation in certain cell types and 
certain situations, e.g. during the transition from quies- 
cent, resting B lymphocytes to proliferating B blasts. In 
this context, c-Jun/c-Fos differs from Zta, which fre- 
quently binds to many viral meZRE-containing promoters 
and efficiently reverses their epigenetic repression 
(5,7,14,45). These differences could be explained by the 
fact that the key functions of Zta in the EBV model are 
understood, whereas the c-Jun/c-Fos transcription factor 
might have yet to be uncovered functional roles in differ- 
ent settings. Although the functionahty of the meAP-1 
binding sites cannot be fully uncovered with the experi- 
ments presented here, the experiments document the 
in vivo existence of these elements, which are bound pref- 
erentially if their DNA motif carries a 5^-methylcytosine. 
It is likely that these elements play a critical role in the 
transcriptional activation of their target genes. These 
findings now allow the investigation of other experimental 
settings such as different cell lines or primary cell types as 
well as distinct stimuH, stress signals or developmental 
cues that activate cells, induce their immediate differenti- 
ation or trigger cell fate decisions. The goal is now to 
discover more examples of meAP-1 sites that regulate 
the expression of critical genes to unveil the full range of 
meAP-1 elements and their biological functions. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Onhne. 



ACKNOWLEDGEMENTS 

The authors thank Christine Gobel for technical advice 
with the luciferase assays, Dagmar Pich for her support 
with cell sorting and Vigo Heissmeyer for his adenoviral 
vector constructs, advice and helpful discussions. They 
also thank Reiner Siebert (Institut fiir Humangenetik, 
UniversitatskHnikum Schleswig-Holstein, Christian- 
Albrechts-Universitat zu Kiel) and Steve Hoffmann 
(Transcriptome Bioinformatics, LIFE Research 
Complex, University Leipzig) for the valuable access to 
the methylome data of primary naive B lymphocytes of 
three human donors and helpful suggestions and 
discussions. 



FUNDING 

Alexander von Humboldt Foundation [Humboldt- 
Forschungsstipendium fur Postdoktoranden to M.G.]; 
intramural grants, the Deutsche Forschungsgemeinschaft 
[SPP1230, SFB 1054, SFB 1064, SFB-TR36]; National 
Institutes of Health [CA70723]. Funding for open access 
charge: Helmholtz Zentrum Miinchen. 

Conflict of interest statement. None declared. 



Nucleic Acids Research, 2014, Vol. 42, No. 5 3071 



REFERENCES 

1. Eferl,R. and Wagner,E.F. (2003) AP-1: a double-edged sword in 
tumorigenesis. Nat. Rev. Cancer, 3, 859-868. 

2. Hess, J., Angel,P. and Schorpp-Kistner,M. (2004) AP-1 subunits: 
quarrel and harmony among siblings. /. Cell Sci., 117, 
5965-5973. 

3. Prokhortchouk,E. and Defossez,P.A. (2008) The ceU biology of 
DNA methylation in mammals. Biochim. Biophys. Acta, 1783, 
2167-2173. 

4. Bhende,P.M., Seaman,W.T., Delecluse,H.J. and Kenney,S.C. 
(2004) The EBV lytic switch protein, Z, preferentially binds to 
and activates the methylated viral genome. Nat. Genet., 36, 
1099-1104. 

5. Bergbauer,M., Kana,M., Schmeinck,A., Gobel,C., Rothbauer,U., 
Eck,S., Benet-Pages,A., Strom,T.M. and Hammerschmidt,W. 
(2010) CpG-methylation regulates a class of Epstein-Barr virus 
promoters. PLoS Pathog., 6, el001114. 

6. Farren,P.J., Rowe,D.T., Rooney,C.M. and Kouzarides,T. (1989) 
Epstein-Barr virus BZLFl trans-activator specifically binds 

to a consensus AP-1 site and is related to c-fos. EM BO J., 8, 
127-132. 

7. Woellmer,A., Arteaga-Salas,J.M. and Hammerschmidt,W. (2012) 
BZLFl governs CpG-methylated chromatin of Epstein-Barr virus 
reversing epigenetic repression. PLoS Pathog., 8, el002902. 

8. Holler,M., Westin,G., Jiricny,J. and Schaffner,W. (1988) Spl 
transcription factor binds DNA and activates transcription even 
when the binding site is CpG methylated. Genes. Dev., 2, 
1127-1135. 

9. Hantusch,B., Kalt,R., Krieger,S., Puri,C. and Kerjaschki,D. 
(2007) Spl/Sp3 and DNA-methylation contribute to basal 
transcriptional activation of human podoplanin in MG63 versus 
Saos-2 osteoblastic ceUs. BMC Mol. Biol., 8, 20. 

10. Rishi,V., Bhattacharya,P., Chatterjee,R., Rozenberg,J., Zhao,J., 
Glass,K., Fitzgerald,P. and Vinson,C. (2010) CpG methylation of 
half-CRE sequences creates C/EBPalpha binding sites that 
activate some tissue-specific genes. Proc. Natl Acad. Sci. USA, 
107, 20311-20316. 

11. Iguchi-Ariga,S.M. and Schaffner,W. (1989) CpG methylation of 
the cAMP-responsive enhancer/promoter sequence TGACGTCA 
aboHshes specific factor binding as well as transcriptional 
activation. Genes. Dev., 3, 612-619. 

12. Holzerlandt,R., Orengo,C., Kenam,P. and Alba,M.M. (2002) 
Identification of new herpesvirus gene homologs in the human 
genome. Genome Res., 12, 1739-1748. 

13. Sinclair ,A.J. (2003) bZIP proteins of human gammaherpesviruses. 
/. Gen. Virol, 84, 1941-1949. 

14. Kana,M., Schmeinck,A., Bergbauer,M., Pich,D. and 
Hammerschmidt,W. (2010) AP-1 homolog BZLFl of Epstein- 
Barr virus has two essential functions dependent on the epigenetic 
state of the viral genome. Proc. Natl Acad. Sci. USA, 107, 
850-855. 

15. Wiesner,M., Zentz,C., Mayr,C., Wimmer,R., Hammerschmidt,W., 
Zeidler,R. and Moosmann,A. (2008) Conditional immortalization 
of human B cells by CD40 ligation. PLoS One, 3, el464. 

16. Delecluse,H.J., Hilsendegen,T., Pich,D., Zeidler,R. and 
Hammerschmidt,W. (1998) Propagation and recovery of intact, 
infectious Epstein-Barr virus from prokaryotic to human cells. 
Proc. Natl Acad. Sci. USA, 95, 8245-8250. 

17. Bakiri,L., Matsuo,K., Wisniewska,M., Wagner,E.F. and Yaniv,M. 
(2002) Promoter specificity and biological activity of tethered AP- 
1 dimers. Mol. Cell. Biol, 22, 4952-4964. 

18. Gloeckner,C.J., Boldt,K., Schumacher, A., Roepman,R. and 
Ueffing,M. (2007) A novel tandem affinity purification strategy 
for the efficient isolation and characterisation of native protein 
complexes. Proteomics, 1, 4228-4234. 

19. Klug,M. and RehH,M. (2006) Functional analysis of promoter 
CpG methylation using a CpG-free luciferase reporter vector. 
Epigenetics, 1, 127-130. 

20. Kana,M., Gobel,C. and Hammerschmidt,W. (2012) The 
lytic phase of Epstein-Barr virus requires a viral genome 
with 5-methylcytosine residues in CpG sites. /. Virol, 86, 
447^58. 



21. Li,H., Ruan,J. and Durbin,R. (2008) Mapping short DNA 
sequencing reads and calhng variants using mapping quahty 
scores. Genome Res., 18, 1851-1858. 

22. Jothi,R., Cuddapah,S., Barski,A., Cui,K. and Zhao,K. (2008) 
Genome-wide identification of in vivo protein-DNA binding sites 
from ChlP-Seq data. Nucleic Acids Res., 36, 5221-5231. 

23. Valouev,A., Johnson,D.S., Sundquist,A., Medina,C., Anton,E., 
Batzoglou,S., Myers,R.M. and Sidow,A. (2008) Genome-wide 
analysis of transcription factor binding sites based on ChlP-Seq 
data. Nat. Methods, 5, 829-834. 

24. Bailey,T.L., Winiams,N., Misleh,C. and Li,W.W. (2006) MEME: 
discovering and analyzing DNA and protein sequence motifs. 
Nucleic Acids Res., 34, W369-W373. 

25. Seto,E., Moosmann,A., Gromminger,S., Walz,N., Grundhoff,A. 
and Hammerschmidt,W. (2010) Micro RNAs of Epstein-Barr 
virus promote cell cycle progression and prevent apoptosis of 
primary human B ceUs. PLoS Pathog., 6, el 00 1063. 

26. Grant,C.E., Bailey,T.L. and Noble,W.S. (2011) FIMO: 
scanning for occurrences of a given motif. Bioinformatics, 21, 
1017-1018. 

27. Meyer,L.R., Zweig,A.S., Hinrichs,A.S., Karolchik,D., Kuhn,R.M., 
Wong,M., Sloan,C.A., Rosenbloom,K.R., Roe,G., Rhead,B. et al 
(2013) The UCSC Genome Browser database: extensions and 
updates 2013. Nucleic Acids Res., 41, D64-D69. 

28. Kulis,M., Heath,S., Bibikova,M., Queiros,A.C., Navarro,A., 
Clot,G., Martinez-Trinos,A., Castenano,G., Brun-Heath,L, 
Pinyol,M. et al. (2012) Epigenomic analysis detects widespread 
gene-body DNA hypomethylation in chronic lymphocytic 
leukemia. Nat. Genet., 44, 1236-1242. 

29. Le Gallou,S., Caron,G., Delaloy,C., Rossille,D., Tarte,K. and 
Fest,T. (2012) IL-2 requirement for human plasma ceU 
generation: coupHng differentiation and proliferation by 
enhancing MAPK-ERK signaling. /. Immunol, 189, 161-173. 

30. Chytil,M., Peterson,B.R., Erlanson,D.A. and Verdine,G.L. (1998) 
The orientation of the AP-1 heterodimer on DNA strongly affects 
transcriptional potency. Proc. Natl Acad. Sci. USA, 95, 
14076-14081. 

31. Raha,D., Wang,Z., Moqtaderi,Z., Wu,L., Zhong,G., Gerstein,M., 
Struhl,K. and Snyder,M. (2010) Close association of RNA 
polymerase II and many transcription factors with Pol III genes. 
Proc. Natl Acad. ScL USA, 107, 3639-3644. 

32. Zhou,H., Zarubin,T., Ji,Z., Min,Z., Zhu,W., Downey,J.S., Lin,S. 
and Han, J. (2005) Frequency and distribution of AP-1 sites in the 
human genome. DNA Res., 12, 139-150. 

33. Benbow,U. and Brinckerhoff,C.E. (1997) The AP-1 site and 
MMP gene regulation: what is all the fuss about? Matrix Biol, 
15, 519-526. 

34. Li,M., Ge,Q., Wang,W., Wang,J. and Lu,Z. (2011) c-Jun 
binding site identification in K562 cells. /. Genet. Genomics, 38, 
235-242. 

35. Church,D.M., Schneider,V.A., Graves,T., Auger,K., 
Cunningham,F., Bouk,N., Chen,H.C., Agarwala,R., 
McLaren,W.M., Ritchie,G.R. et al. (2011) Modernizing reference 
genome assemblies. PLoS Biol, 9, el 00 1091. 

36. Richards, S., Watanabe,C., Santos,L., Craxton,A. and Clark,E.A. 
(2008) Regulation of B-cell entry into the cell cycle. Immunol 
Rev., 224, 183-200. 

37. Leonard,D.A. and Kerppola,T.K. (1998) DNA bending 
determines Fos-Jun heterodimer orientation. Nat. Struct. Biol, 5, 
877-881. 

38. Risse,G., Jooss,K., Neuberg,M., Bruller,H.J. and Muller,R. (1989) 
Asymmetrical recognition of the pahndromic API binding site 
(TRE) by Fos protein complexes. EMBO J., 8, 3825-3832. 

39. Bartels,S.J., Spruijt,C.G., Brinkman,A.B., Jansen,P.W., 
Vermeulen,M. and Stunnenberg,H.G. (2011) A SILAC-based 
screen for methyl-CpG binding proteins identifies RBP-J as a 
DNA methylation and sequence-specific binding protein. PLoS 
One, 6, e25884. 

40. Niesen,M.I., Osborne,A.R., Yang,H., Rastogi,S., Chellappan,S., 
Cheng,J.Q., Boss,J.M. and Blanck,G. (2005) Activation of a 
methylated promoter mediated by a sequence-specific DNA- 
binding protein, RFX. /. Biol. Chem., 280, 38914-38922. 

41. Zhang,X.Y., Jabrane-Ferrat,N., Asiedu,C.K., Samac,S., 
Peterlin,B.M. and Ehrlich,M. (1993) The major histocompatibility 



3072 Nucleic Acids Research, 2014, Vol. 42, No. 5 



complex class II promoter-binding protein RFX (NF-X) 
is a methylated DNA-binding protein. Mol. Cell. Biol., 13, 
6810-6818. 

42. Sasai,N., Nakao,M. and Defossez,P.A. (2010) Sequence-specific 
recognition of methylated DNA by human zinc-finger proteins. 
Nucleic Acids Res., 38, 5015-5022. 

43. Hu,S., Wan,J., Su,Y., Song,Q., Zeng,Y., Nguyen,H.N., Shin,J., 
Cox,E., Rho,H.S., Woodard,C. et al. (2013) DNA methylation 
presents distinct binding sites for human transcription factors. 
Elife, 2, e00726. 



44. Glover,J.N. and Harrison,S.C. (1995) Crystal structure of the 
heterodimeric bZIP transcription factor c-Fos-c-Jun bound to 
DNA. Nature, 373, 257-261. 

45. Karlsson,Q.H., Schelcher,C., Verrall,E., Petosa,C. and 
Sinclair,A.J. (2008) Methylated DNA recognition during the 
reversal of epigenetic silencing is regulated by cysteine and serine 
residues in the Epstein-Barr virus lytic switch protein. PLoS 
Pathog., 4, el000005. 



